# Dynamic High-Resolution Processing
Internvit 6B 448px V2 5
MIT
InternViT-6B-448px-V2_5 is a major upgrade based on InternViT-6B-448px-V1-5, enhancing visual feature extraction capabilities through ViT incremental learning and NTP loss, particularly excelling in handling complex scenarios like multilingual OCR data and mathematical charts.
Text-to-Image
I
OpenGVLab
711
36
Internvit 300M 448px V2 5
MIT
InternViT-300M-448px-V2_5 is a major upgrade based on InternViT-300M-448px, enhancing visual feature extraction capabilities through ViT incremental learning and NTP loss, particularly excelling in handling multilingual OCR data and complex scenarios like mathematical charts.
Text-to-Image
I
OpenGVLab
23.29k
33
Featured Recommended AI Models